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T 1 Abstract 

M. Smorodinsky and S. Laurent have initiated the study of the filtra- 
i-^H tions of split-word processes, in the framework of discrete negative time. 

For these nitrations, we show that Laurent's sufficient condition for non 
standardness is also necessary, thus yielding a practical standardness crite- 
rion. In turn, this criterion enables us to exhibit a non standard filtration 
£NJ which becomes standard when time is accelerated by omitting infinitely 

many instants of time. 

m 

1 Introduction 

o 

We shall be interested in nitrations, in the setting of discrete, negative time: given 
a probability space (Q,A,F), a filtration is an increasing family J 7 = (J r n ) n ^o 
of sub-u-fields of A; observe that the time n ranges over all negative integers. 
( Equivalent ly, one could consider decreasing families of a-fields indexed by pos- 
itive integers, known as reverse nitrations; but we find it more convenient to let 
time run forward, with n + 1 posterior to n, at the mild cost of dealing with 
negative instants.) As discovered by Vershik [TO], in this framework very subtle 
phenomena occur in the vicinity of time — oo. 

For a simple example, suppose that the u-field n n .F n is degenerate and that, 
for each n, J- n is generated by T n -\ and by some Bernoulli random variable 
U n which is independent of T n -i and uniformly distributed on the 2-set {0, 1}. 
Under these hypotheses, it may happen that J 7 contains more information than 
the natural filtration of the Bernoulli process U = (U n ) n ^o (this is similar to 
weak solutions in SDEs); but something more surprising is also possible: that J 7 
is not generated by any Bernoulli process whatsoever. Such nitrations have been 
called non standard by Vershik, who has given in [10] a necessary and sufficient 
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criterion for standardness, and several examples of non standard nitrations. The 
rigorous definition of a standard nitration will be recalled later, in section |4j 

All nitrations considered in this study have an additional property: for each n, 
T n is generated by T n -\ and by some random variable U n which is independent 
from J- n -i and uniformly distributed on some finite set with r n elements. Such 
a filtration is called (r n )-adic. For these nitrations, as shown by Vershik [10J, 
standardness turns out to be tantamount to a simpler, much more intuitive prop- 
erty: an (r n )-adic filtration J 7 is standard if and only if J 7 is of product type, 
that is, T is the natural filtration of some process V = (V n ) n ^o where the V n are 
independent random variables (in this case, it is easy to see that the process V 
can be chosen with the same law as U.) So, at first reading, 'standard' can be 
replaced with 'of product type' in this introduction. 

When time is accelerated by extracting a subsequence, that is, when (J r n ) n6 z- 
is replaced with (J r n ) ne g where Q is some infinite subset of the time-axis Z~, 
a standard filtration always remains standard, but a non standard one may be- 
come standard (or not). Examples of this phenomenon were first studied by 
Vershik in the framework of ergodic theory, and then, in a probabilistic setting, 
by Laurent [8]. Lacunary isomorphism theorem [5j states that, from any filtration 
(J r n)n£Z~ (such that jFq is essentially separable), there exists Q C Z~ such that 
(J r n)n£Q is standard. 

By varying the parameters in an example initially due to Vershik [10] and later 
modified (in the dyadic case) by M. Smorodinsky |6J, Laurent [E] has described 
a family of nitrations, the split-word nitrations; he has shown some of them (the 
fastest ones) to be standard, and some other ones (the slowest ones) to be non 
standard; but an intermediate class was left undecided. We continue his study, 
and show that all these intermediate nitrations are in fact standard. This yields 
an easily verifiable necessary and sufficient condition for a split-word filtration to 
be standard. 

As the family of split-word nitrations is stable by extracting subsequences, this 
criterion makes it simple to observe on these examples the transition from non- 
standardness to standardness when time is accelerated. We find that this transi- 
tion is, in some sense, sharp: in Example [2j we exhibit a non standard filtration 
T such that, for every infinite subset Q of Z~ with infinite complementary, the 
corresponding extracted filtration (J-" n ) ne Q is standard. This J 7 is as close to 
being standard as possible, for, if Q C Z~ is cofinite and if Q is any filtration, 
the extracted filtration {Q n )n^Q clearly has the same asymptotic properties (stan- 
dardness, product type, etc.) as Q. To our knowledge, in the earlier literature, 
the best result in this direction was the existence of a non standard filtration T 
such that (J"2n)n^o is standard (examples are given by Vershik [10], Gorbulsky [7] 
and Tsirelson in an unpublished paper). 

In this paper we study the nitrations of split-word processes. These processes 
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are inspired by examples given by Vershik [TOl example 2,3,4] and have been 
introduced and studied in terms of probability theory by Smorodinsky [6] in the 
dyadic case and by Laurent [8] in the general case. 

The distribution of a split-word process depends on an alphabet A of size N ^ 2 
and on a sequence of positive integers (AJn<o such that £ = 1 and, for every 
n ^ 0, the ratio r n = £ n _i/£ n is an integer r n ^ 2. The sequence (X n ) n ^ of split 
words is indexed by the nonpositive integers. For every n ^ 0, the law of X n is 
uniform on the set of words of length i n on A. Moreover, if one splits the word 
X n _i, whose length is £ n -i, into r n subwords of length £ n , then the word X n is 
chosen uniformly among these subwords, independently of everything up to time 
n — 1. More precisely, denote by V n the location of the subword X n in X n _ x . 
Then V n is uniform in {1,2,..., r n }. The split-word process is (X n , V n ) n ^Q. 

Call T x ' v = (J rX ' v ) n ^o the natural filtration of (X n , V n ) n ^Q. Clearly, every sub- 
sequence [T x ,v )neQ with Q C Z~ is the natural filtration of a split-word process 
with lengths (C/d)neQ on the alphabet A £m , where m = maxQ. 

The filtration J 7X,V is (r ra )-adic since for every n ^ 0, 

' v = J^n-i V cr(V n ) with V n independent of J~^li- 

Moreover, the tail cr-field J 7 ^ is trivial, thanks to proposition 6.2.1 in [8]. Yet, 
the inclusion J 7 ^ C T X ' V is clearly strict since X is independent of (V^) n ^ - 
However, the filtration J 7X ' V may still be a product type filtration (generated by 
some other independent sequence). 

1.1 Results 

Surprisingly, the nature of the filtration T X ' V depends on the sequence (£ n ) n ^o- 

Theorem 1 The filtration J 7X,V is standard ( or equivalently, is of product type ) 
if and only if, the series — — — diverges. 

n 

The 'if part of the theorem, which is new, will be proved in section [3j and the 
'only if part in section |4} 

Note that the convergence of the series — is equivalent to the condition 
A of Laurent [8] , who established the 'only if part of theorem [I] 
The condition that we call ->A (the divergence of the series — which 
implies standardness, improves on Laurent's sufficient condition of standardness: 
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(Vat) There exists a < 1 such that r" 3> N £n as n — > — oo. 

Laurent notices that conditions Vat are weaker than the condition 

— > oo as n — > — oo, (V) 

which does not depend on N. Laurent indicates that conditions V and A were 
previously introduced by Vershik [10, example 1], in the context of decreasing 
sequences of measurable partitions. 

Since conditions V and A do not exhaust all possible situations, both Vershik 
and Laurent asked what happens "between V and A" . In theorem [TJ we solve 
Laurent's question: condition A is in fact necessary and sufficient for the filtration 
of the split-word process to be non standard. 

After the completion of this paper, Anatoly Vershik drew our attention to the 
paper [13], where Heicklen obtains a result equivalent to Theorem [TJ The (V n )- 
adic nitrations studied by Heicklen were introduced by Vershik [TJ] as follows. 
(We mention that both Vershik and Heicklen index sequences by the set of non- 
negative integers whereas we index them by the set of nonpositive integers, and 
that this is the only difference between their presentation and ours, given below.) 

Let A be a finite alphabet and (G n )n^o the decreasing sequence of groups defined 
by 

o 

G n = Z /^ Z - 

k=n+l 

Let G denote the union over n ^ of the groups G n . For every n ^ 0, the group 
G n acts on A G (on the left) by canonical shifts. Namely, for every g G G n and 
/ G A G , one defines g ■ f G A G by (g ■ f)(x) = f(xg) for every x G G. Let 
Orbcr n (/) denote the orbit of a given / G A G under the action of G n , that is, 
Orb G „(/) = {g ■ f ; g G G n }. 

Let F = {F(x)) x£ g £ A G be a random function whose coordinates F(x) are 
independent and uniformly distributed in A. The filtration (J r n )„^o studied by 
Vershik and Heicklen is the natural filtration of (O n ) ns go ; where 

O n = Orb G „(F). 

For each n ^ 0, 0„_i is the union of r n orbits under the action of G n , one orbit for 
each element of G n -\/G n . Almost surely, these orbits are all different since the 
shifted functions g-F are different. Futhermore, conditionally on (0„_i, O n _2, ■■■), 
the random variable O n is uniformly distributed on these r n orbits. This shows 
that (F n ) n <^o is an (r n )-adic filtration. 

One can show that the tail cr-field J-~oc is trivial and that the filtration of the 
split-words process is immersed in (J r ra ) ns go- Informally, the word X n at time n 
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is given by the values at e (the identity of the group G) of the elements of O n . 
One gets X n from X n _ x by splitting the orbits under G n _i into r n orbits under 
G n and by choosing one of these orbits uniformly randomly. 

Thus, the standardness of (J r n )n^o when condition A fails implies that the natural 
filtration of the split- words process is standard and, therefore, that it is of product 
type. Heicklen's proof relies on Vershik's standardness criterion and uses the 
language of ergodic theory. Although Heicklen's result and our Theorem [T] are 
logically equivalent, we believe that our proof is interesting because it relies on a 
constructive, direct and probabilistic method. 

This result has interesting applications in ergodic theory as we now explain. Re- 
call that entropy is a well known invariant associated to an automorphism of a 
probability space (that is, a bimeasurable application preserving the measure). 
Vershik [3] defined a much more elaborate invariant, named the scale. But com- 
puting this invariant is a very difficult task, even in simple cases. However, 
Laurent showed that our result provides the exact scale of a dyadic transforma- 
tion [2]: more precisely, the scale of this transformation is the set of sequences 
(^n)n^o fulfilling condition A. 

Using Vershik's theory, one can deduce from theorem [T] that the filtration of 
the split-word process on any separable alphabet (endowed with an arbitrary 
measure) is standard under condition -A, see [UJ for a proof. 

1.2 Examples 

Condition -A forces the length £ n to grow very quickly as n goes to — oo. If a 
given sequence (^ n ) n ^o is ~~A, then every sequence (£' n )n^o such that (\nr' n )/£' n ^ 
(lnr n )/£ n is -A as well. No similar property holds for the sequence (£ n )n^o 
only, nor for the sequence (r n )„^ only. Indeed, the first example of this section 
provides a sequence (r n ) n which is -A and such that (r^) n ^ is A. 

Example 1 (A standard split-word filtration) Let £ = 1 and £ n -\ = 2 £n 
for every n ^ 0. That is to say 



n 



2 



where the figure 2 appears \n\ times. 



Then for every n < 0, 




lin — ln+1 



Therefore 



log 2 ( r n)/^ 



n 




^1, 



which proves that (r n ) is -A. 
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Theorem [T] has another interesting consequence which we now explain. Recall 
that, by the lacunary isomorphism theorem [5], from any filtration (J- n )n^o (such 
that J-q is essentially separable), one can extract a filtration {J^^neQ which is 
standard. In [TP] , Vershik provides an example where (J r ri ) ns go is non standard 
whereas (J-2n)nsgo is standard. In [7J, Gorbulsky also gives such an example. 
Theorem [I] provides an example of a non standard filtration (example [2] below) in 
which the transition from the non standard case to the standard case is very sharp: 
(J r n ) ne Q is standard for every infinite subset Q of Z~ with infinite complementary. 



Example 2 (A non standard filtration close to standardness) Set£ = 1 
and £ n _i = 4^ for every n ^ 0. That is to say 

2 

£ n = 4 22 where the figure 2 appears \n\ — 1 times. 

Then the filtration [J-^' ) n <o is not of product type. Yet, if cf) is a strictly 
increasing application from — N into — N such that 4>(n) — n — > — oo as n — > — oo, 
then the filtration {J~^p)n^o is of product type. 



Proof. On the one hand 



log 2 r n log 2 4_i _ 2 



£n £n \f£~n 



which is the general term of a convergent series, thus (£ n ) n ^o is A. On the other 
hand the filtration (^(^)n<o is the filtration of a split-word process of length 
process (£' n ) n ^o = (^(n))n<o- The ratios between successive lengths are, for n ^ 0, 

j _ / // _ 



If 4>(n) — n — > — oo when n — > — oo, then <p{n — 1) ^ 0(^) — 2 infinitely often. For 
these n, 



0(n) 



^■<f>(n) "<f>(n) *-0(n) 

hence 

log 2 r; ^ 2 2V^ log 2 ^ ( n) 

£' n £(f>(n) @(j>(ri) 

This shows that a subsequence of (log 2 (r^)/£^) n converges to infinity, hence that 
(On is -A. ~ ' □ 
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2 Laurent's method and tools 



In this section, we introduce the tools used by Laurent to prove that under 
condition V, the filtration of the split-word process is of product type. Laurent 
used a canonical coupling to build explicitly a sequence of innovations (V^) n ^o 
which generates the process (X n , V n ) n ^ . 

Definition 1 If {F^n^o a filtration, and (U n ) n ^Q is a sequence of random 
variables such that for every n ^ ; 

J~n — J~n-x V o-{U n ) with U n independent of T n -\, 

one says that (U n ) n ^o is a sequence of innovations for (J r n ) ns go- 

This method is strengthened in section |3j where we consider a partial canonical 
coupling to improve on condition V. 

We remind the reader that sequences are indexed by the nonpositive integers. 
2.1 Change of innovations 

We start with a complete definition of split-word processes. 

Definition 2 (Split-word process) Let (r n ) n ^o denote a sequence of integers 
such that r n ^ 2 for every n ^ 0. Set £ = 1 and, for every n ^ ; £ n _i = r n £ n . 
Let A denote a finite set, called the alphabet, with cardinal N ^ 2. 

A split-word process is any process (X n , V n ) n ^o such that, for every n ^ 0, 

• X n is uniformly distributed on A ln , 

• V n is uniformly distributed on {1, ... , r n } and independent of the a-algebra 
Fn-i = cr(X m , V m ; m ^ n - 1), 

• if the word X n _i (with length / n _i = l n r n ) is partitioned into r n subwords 
of length l n , X n is the V n -th among those r n subwords. 

The sequence (X n ) n ^ is a inhomogeneous Markov process indexed by the nega- 
tive integers and generated by the innovations (V n ) n ^o. The existence of such a 
process (X n , V„) n ^o is guaranteed by Kolmogorov's theorem. 

To prove that, under some conditions, the filtration of the split-word process 
is of product type, one has to switch from one set of innovations to another. 
Lemma 2.1| provides a general method to build new innovations. 
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Lemma 2.1 (Change of innovations) For every n ^ — 1, let {<fZ}w denote a 
family of permutations of {1, . . . ,r n+1 } , indexed by the elements w of A in , and 
let 

Then (V^) n ^o is a sequence of generating innovations for (X n ) ns g - This means 
that, for every negative integer n ^ — 1, the following properties hold: 

• The random variable V^ +l is uniformly distributed on {1, . . . , r n+1 }. 



The random variable V^ +1 is independent of T*' and therefore also of 
r-wr' 

The random variable X n+ \ is a measurable function of X n and V^ +1 . 



Proof of lemma 2A_ , For every negative integer n and every v such that 1 ^ v ^ 
r n+ i, a simple computation proves that 



n+1 



v\T*> v ] = P[K+i = {^xX»\^n' V ] = Vw 



This shows the first two properties. The third property follows from the fact that 
X n+ i is the fc-th subword of X n , where k = ((fx )~ 1 {V^). □ 



2.2 Canonical word and coupling 

To build the innovations which generate the process (X n ) n ^ , one can use, and 
improve on, Laurent's construction under the stronger condition V. This uses 
the notions of canonical word and canonical coupling, which we recall below. 

Definition 3 (Canonical alphabets and canonical words) For every inte- 
ger M ^ 2, the canonical alphabet on M letters is Am = {1, . . . , M}. Canonical 
words on Am are the words whose i-th letter is congruent to i modulo M. Hence, 
the letters of Am appear in order and are repeated periodically. 

Canonical words will usually be denoted by the letter c. For example the canonical 
word of length 11 on A 3 is 12312312312. 

Notation 1 (General notations) To simplify the definition of the canonical 
coupling, one identifies any ordered alphabet B of size M ^ 2 with Am according 
to the rank of each letter in the alphabet B. 

The i-th letter of a word w is denoted by w{i). Let w = (w(i))i^i^ r denote a word 
of length r. For every 1 ^ i ^ r, H(w, i) denotes the number of instances of the 
letter w(i) among the (i — 1) first letters of w: 

H(w,i) = l{w(j)=w{i)}- 
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Definition 4 (Canonical coupling) Let w denote a word of length r on an 
ordered alphabet B of size M ^ 2. The canonical coupling associated to w is the 
permutation ip w of {1, • • • , r} defined as follows: for every i ^ r, 

(p w (i) = w (i) + H(w,i)M if w(i)+H(w,i)M < r. 

After this process has been applied to every i, one chooses (p w (j) for the integers 
j such that w(j) + H(w,j)M > r, in an increasing way and in order to make (p w 
a bisection. (So <p w (j) is the smallest k which does not belong yet to the range of 

Later on, we apply the notions of canonical word and canonical coupling to some 
alphabets A with £ ^ 1. 



B A 


c 


A D [C|A 


/\ i 


IXX 


A B 


c 


D A A C 


A B 


c 


D A B C 



w 



w 
c 



Figure 1: Example of canonical coupling 
By construction ip w is one of the permutations y? such that ip ■ w := w o tp^ 1 is 



as close as possible to a canonical word. Lemma |2.2| makes this statement more 
precise. 

Lemma 2.2 (Comparison of w and c o ip w ) Let c be the canonical word of 
length r ^ 1 on an ordered alphabet B of size M ^ 2. Then for every 1 ^ i ^ r 
and w G B r , 

c((p w (i)) = w(i) if and only if w(i) + H(w,i)M ^ r. 



Proof of lemma 2.2 By definition of c, the number of instances of j in c is 



Kj = m&x{k ^0:j + (k- l)M < r} = L^^-J + 1 = ^ + L ~ 1 ■ 



M 



If w(i) + H(w, i)M ^ r, then ip w (i) = w(i) + H(w, i)M, thus c(<p w (i)) = w{i) by 
definition of c. 

Otherwise, w(i) + H(w,i)M > r, hence the number H(w,i) of instances of w(i) 
among the i — 1 first letters of w is at least the number K w u\ of instances of the 
letter w(i) in c. According to the first case, ip w sends the ranks of the K w ^ first 
instances of w(i) in w on the ranks of the instances of w(i) in c. Since </> w is 
bijective, the rank of every instance of the letter w(i) in c has an antecedent by 
(p w that is less than %. Therefore the letter in c with rank (p w (i) cannot be w(i), 
that is, c(ip w (i)) ^ w{i). □ 



Lemma 2.2 implies lemma 2.3 below, which is a slight improvement on Laurent's 



lemma 6.3.2 of [8]. 

Lemma 2.3 Let X denote a uniform random word of length r on an ordered al- 
phabet B of size M ^ 2, V a uniform random variable on {1, ... , r} ; independent 
of X , and c the canonical word of length r on B. Then, 

¥[X(V) ^ c(ip x (V))} ^ M/r + 2(M/r) 1/3 . 



Proof of lemma \2J\ Since X(V) ^ M and {X(V) = c(<p x (V))} = {X(V) + 
H{X, V)M ^ r}, 

{X(V) = c(ip x (V))} D {H(X, V)M < r - M}, 

hence, for every positive s, 

{X(V) = c(ip x (V))} D {H(X, V)M < V - 1 + Ms} n {V - 1 + Ms < r - M}. 
Taking complements, one gets 

{X(V) ^ c((p x (V))} C {H(X, V)M > V - 1 + Ms} U {V - 1 + Ms > r - M }, 
which yields 

P[X(V) ^ c(v?x(V r ))] < P[i/(X, V)M > V — 1 + Ms] + P[V - 1 + Ms > r - M]. 
Since r + 1 — V and are both uniform on {1, ... , r}, 

P[V - 1 + Ms > r - M] = F[V < M(s + 1)] < M(s + 1) jr. 
On the other hand, 

F[H(X, V)M > V - 1 + Ms] < P 7) - (V - 1)/M\ > s] . 
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Since V and X are independent, conditionally on V the distribution of H(X, V) is 
binomial Bin(V— 1, 1/M). The conditional expectation of H{X, V) is (V — l)/M 
and the conditional variance of H(X, V) is 

(V - 1)(M - 1)/M 2 < V/M, 

hence the Bienayme-Chebychev inequality yields 

F[H(X,V)M >V-1 + Ms]< E[F]/(Ms 2 ) = (r + l)/(2Ms 2 ) ^ r/(Ms 2 ). 

Finally, 

P[X(V) ^ c(<p x (y))] < M(s + l)/r + r/(Ms 2 ). 
This upper bound for s = (r/M) 2 / 3 implies the statement of the lemma. □ 

2.3 Adapting Laurent's proof 

Let us state a slight improvement on Laurent's result [8, proposition 6.3.3]: 

Theorem 2 If r n ^> N e " as n — >■ — oo, i/ien £/ie natural filtration of (X n , V n ) n ^o 
is of product type. 

Let us introduce notations for the proof. From now on, a word of length i n on 
the alphabet A will often be seen as a word of length r n+ \ on the alphabet A £n+1 . 

Proof of theorem \^ 

• Choice among sub-words: for every x in A 1 "- 1 and v in {1, ... , r n }, denote 
by f n (x,v) = x(v) the v-th letter of a; seen as a word of length r n on the 
alphabet A in . So X n = / n (X n _ 1; V n ). 

• New innovations: let c n _i be the canonical word of length r n on the alphabet 
A e " (for some fixed order on A n ). Let y?x n _i be the canonical coupling 
associated to X n _i seen as a word of length r n on A in . Set = ^x n _i(Ki)- 

• Construction of a sequence (X' n ) n ^ approximating (X n ) n ^ : set 

The key point is to show that 

P[X„ = X' n ]->l as n ->■ -oo 

by bounding above 

PfX^K) ^^^(K))]. 
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Applying lemma 2.3 to X n _i seen as a word of length r n on the alphabet A £n , 
one gets that, 

P[X n ^ X' n ] sC N l -/r n + 2{N l "/r n ) l '\ 
Hence F[X n ^ X' n ) since < r n . 

Define an application : A 1 ™- 1 x {1, . . . ,r n } — > A ln by f' n {x,v') = f n (x,(p~ l (v')). 
Then /^-x, = X n and ■ , K) ° • • • ° /m+i( ■ , C +1 )(X m ) = X n for 
m ^ n ^ 0. Therefore, under the assumption V, one has, 

— > as m — >• — oo. 

This implies the convergence in probability: 

X n = lim f' n { • ,K)°/n-l( • X-l) ° • • • ° /m+l( • ,VL+l)( X 'm) 



m—>—oo 



lim f' n ( ■ ,K)o...o/' ( • ,lC +1 )o/ m ( ■ ^^(cm-i) 



1+1 ; 

vn — ¥ — oo 



and proves that the innovations {V^)k^ n determine the words (X k ) k ^ n and the 
innovations (Vk = ^xjLi(^')) fc ^ n ' Thus, the filtration (Jn ^)„^o is generated 
by the innovations (V^) n ^o- D 



3 Improving on condition V 

3.1 Statement of the main result 

As said before, there is a gap between the conditions V and A under which the 
problem has been solved by Laurent. Our next theorem bridges the gap between 
the two conditions. 

Theorem 3 If the series ^^ln(r n )/£ n diverges (condition then the filtra- 

n 

tion (Jn' V ^)n<o ^ of product type. 

Remark 1 Laurent JEj/ states condition A as the convergence of the series 

ln(r n !) 



E 



n 



The inequalities ^rln(r) ^ ln(r!) ^ rln(r), valid for every r ^ 2, ensure that the 
series 

ln(r n ) ^ ln(r n !) 



<■«<< E 



n-l 

n 
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both converge or both diverge. Hence Laurent's condition A and the condition A 
which we stated in our introduction and used since, are indeed equivalent. 



Condition -A is easy to express, but less handy to prove things. The equivalent 
wording below, which is closer to theorem [2j is more convenient. 

Proposition 3.1 (Rewording of condition -A) Condition A fails if and only 
if there exists a sequence (a n )n^o of nonnegative real numbers and an increasing 
application <p : — N — > — N, such that the following properties hold: 

1. For every n < 0, r<^( n ) ^ jV 2Q ™ £ <M™) . 

2. The series a n diverges. 

n 

Furthermore, when they exist, the sequence (a n )n^o and the application can be 
chosen in such a way that the additional properties below hold: 

3. When n — > —oo, r^( n ) N 2an£ ^ n '> (in particular, r^ n ) — >■ +00^. 

4. The series ^^a 2n diverges. 

n 

5. For every n ^ 0, < a n ^ 1. 

6. For every n ^ —2, the ratio a n £^t n )/ '^(n+i) is an integer. 



The proof of this result can be found in section 3.4 



3.2 Construction of the new innovations 

The construction of the new innovations uses a partial canonical coupling. This 
tool sharpens the canonical coupling that was introduced in section [23 and which 
has to be kept in mind. 



Under the hypotheses of theorem [3j the ratios (r„) ns g are no longer big enough 
for the innovations associated to the canonical coupling to approach the entire 
word X n in only one step. Therefore several steps are necessary to get a good 
information on the word. 

Definition 5 (Partial canonical coupling) Let w be a word of length £r on 
the alphabet A and A G {!,••• ,£} an integer. Denote by w the word extracted 
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from w by splitting w into r sub-words of length £ and keeping only the first X 
letters of each sub-word. In other words, if w = (wi, . . . ,wg r ), then 

W = (Wi, ...,W X , W t +i, W £+X , W( r -1)£+1, • • • , W( r _i) £+A ) 

is the word constituted of the letters Wi such that i = j mod £ with 1 ^ j ^ A. 

Let (fiw be the canonical coupling of w towards the canonical word c of length r 
on the alphabet A x . Since ipu, belongs to & r (the symmetric group on r letters), 
one can apply it to the r sub-words of w of length i. The permutation ipw^ = (pa 
is called the partial canonical coupling of rate X/£ associated to w. 
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B 



A 



-> rBirAircircirA 



w 



<p. 



w 



A 



B 



w' ^~ 



[ATB lCllAf cRy,-™ 
fATBTcTATBl c 



Figure 2: Example of a partial canonical coupling. In the proof, this coupling 
will be considered with £ = £^ 2n ), A = a 2n , r = r^ (2ri ), w = X^ 2n )-i and 



Proof of theorem [3j Assume that ->A holds. Fix <fi and (a n ) n ^o which fulfill 



conditions 1,2,3,4,5 and 6 of proposition 3.1. We now construct new innovations 



(V^) ns go which generate the same filtration than the process X. 

Let us define new innovations {V^k^o as follows. For every k ^ 0, define V{. = 
(f'x™_ 1 (Vk) = fxk-jy^) ^ there exists an integer n (necessarily unique) such that 
k = 4>(2n), and V fc ' = 14 otherwise. 

Lemma 2.3 will be used to show that with probability close to 1, the first a.<int<t>(2n) 
letters of the words X^ 2n ) and those of /^(2n.)(C^,( 2 „)-i, V^ (2n) ) coincide. 



15 



B 



\C [C \\A\ 



4>(2nM 



B II A 


cicia 


X 


x^ 


A B 


cifAirc 



i>(2n)-1 
A lf(2n)-1 



B 



l J-(2n) 



[aTb1c|a[ b1 c t(2n 



j>(2n+1) 



Figure 3: Description of the method. 



3.3 Proof of the main result 

This proof is split into three steps. 



First step: X^n+x) comes from the beginning of X^n) infinitely often 

One focuses on the events 

A n = {X^ 2n +i) comes from the first a 2n ^(2n) letters of X^ 2n )}- 

One computes P[A n ] by counting: the number of possible choices for the innova- 
tions V fc for (f)(2n) + 1 < k < (f)(2n + 1) is 

0(2n+l) 

] j T k = ^(2n)/^(2n+l)- 
fc=</>(2n)+l 

The number of cases such that A n occurs is the number of sub-words of length 
^(2n+i) entirely included in the first a2n^>(2n) letters of X^n)' this number is 
«2n^>(2n)/^0(2n+i) thanks to the additional hypothesis that 
integer. Therefore f[A n ] = a>2 n and the series J2 n ^[Ai] diverges. 

Moreover A n is a (deterministic) function of Vk for <f)(2n) + 1 ^ k ^ 0(2n + 1), 
hence the events A n are independent and the Borel-Cantelli lemma ensures that 
almost surely, A n occurs for infinitely many n. 
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Note that A n G J^^n+i) f° r ever Y n since V£ = Vk for every time k which is not 
one of the integers 0(2n). 



Second step: Use of lemma 2.3 



Our purpose is to prove lemma 3.2 



Lemma 3.2 For every n ^ 0, set lM 2n ) = {1> a 2n^(2n)} an< ^ a word 
C^(2n)-i o/ Zen##i €^(2n)-i = f<t>{2n)^<t>{2n) on A suc h that C^(2n)-\ is the canonical 
word of length r^ 2n ) on the alphabet A a2n W 2 ™) . T/ie probability for X^(2n)(I^{2n)) 
(the first a 2n ^(2n) letters of X^ 2n )) to be the V^ 2n) -th sub-word ofC^ 2 n)-i con- 
verges to 1 as n tends towards — oo. That is to say, 

P [X<f,(2n){h{2n)) = U(2n)(C^2n)-l, V^(2n))(^(2n))] 1, OS 77, ^ -OO. 



Proof of lemma 3.2. Note that X^n^J^n)) is the V^(2 n )-th letter of X^( 2n )_i 
seen as a word of length r^( 2n ) on the alphabet A a2nl ^ 2n ) (where X^ n)-! is built 
from X^( 2n )-i according to definition [5] and the caption of figure 3.2). Hence 



P [^(2n)(^(2n)) - f<t>(2n){C<p(2n)-l, ^(2n)) (^(2n))] 



P 



^(2n)-l(^(2n)) = C^.(2n)-1 (V 7 *^.! (^(2n) ) ) 



2.3 



Lemma 



v4 Q2n ^( 2 ™> provides 



applied to X^( 2n )_i seen as a word of length r^( 2n ) on the alphabet 



P 



^(2n)-l(^(2n)) 7^ U(2n){C(f>(2n)-l, ^(2n))Cfy(2n)) 

< jV a2 ^)/r 0(2n ) + 2(Ar Q2 ^( 2 ")/r^ (2 „)) 1/3 . 



From proposition 3.1, each term converges to 0, hence lemma 3.2 holds 



Third step: Use of the innovations to recover (X n ) n ^ 

Our aim is to show that, for every m ^ 0, X m is a function of the innovations 
(Vfc)fc^ m . Consider once again the events 

A n = {X^ 2n+1 ) comes from the first a 2 n^(2n) letters of X^ 2n )}. 

If the event A n occurs and if 

-X>(2n)(-^(2n)) = /^(2n)(C^(2n)-l, V^(2n))(^(2n))) 
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then 

X<f,(2n+l) = U(2n+1)(', K/>(2n+l)) ° • • • ° U(2n)(', ^(2n))(^(2n)-l)- 

Moreover, since A n depends only on (VL 2n \ +v ■ ■ ■ , V^/ 2n+1 x), it is independent of 

{^(2n)(^(2n)) = /<£(2n) (C^(2n)-1, V $(2n)) (^(2n))}- Tn US 

P[^(2n+1) = /<M2n+l)(-, ^(2n+l)) ° • • • ° U(2n)(^ ^(2n)) (C</>(2n)-l) | v4 n ] 

> P[X^( 2n )(/^(2n)) = f<t>(2n)(C^(2n)-l, ^(2n) ) (^(2n) )] , 



which tends to 1 as n goes to — oo by lemma 3J2 

Some formulas below will be easier to read thanks to the introduction of the 
function g n which associates X n+ i to (X n , V^ +1 ). Namely, for every integer n ^ 0, 
every word x in A in and every integer 1 ^ v ^ r n , define 

g n {x, v) = f n (x, <fx{v)) if n is one of the integers (f>(2k), 
= f n (x, v) otherwise. 

Let X'^n) be the word of length t^n) whose first a 2n £^( 2n ) letters are those of 
the word /0(2n)(C</>(2n)-i, y^n)) an d the °th ers are set to 1. Then, by lemma 



3.2 



P [X^ (2n) (^ (2n) ) = ^ (2n) (^ (2n) )] -^lasn^ -oo. 
For n < m ^ 0, call X' m n the offspring of X'^, 2n s at time m, that is, 

X' m ,n = SW-lO, Vm) ° 9m-%{' , Kn-l) ° • • • ° 5V>(2n)+l(-, V^(2n))P^(2n)-l)' 

Then, 

P [X m 7^ X^JAj] < P [X^ 2n )(I<t>(2n)) 7^ -<^(2n) Cfy(2n) ) | Ai] > 

hence P [X m ^ X^ n K] -> 0, as n goes to — oo. 



Lemma 3.3 below, which will be proved at the end of section 3.4, enables us to 



use Borel-Cantelli's lemma twice. 

Lemma 3.3 Let (a n ) n ^ and (b n ) n ^ denote two bounded sequences of nonnega- 
tive real numbers such that the series b n diverges and such that a n <^ib n . Then 

n 

there exists an increasing application 9 : N — > N such that the series a>o[n) con_ 

n 

verges and the series be( n ) diverges. 
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Continuation of the proof of theorem [3] 

Since the series ^^P(y4 n ) diverges and, for every fixed m ^ 0, 

n 

F[X m j£ X' m JA n ] 0, when n -»■ -oo, 

our last lemma applied to the sequences (PL4 n ]) n ^ m and (P[X m ^ X' mn ; A n ]) n ^ m 
provides a deterministic increasing application 9 : — N — > — N such that 

^ P [^n ^ X»,0(n)5 ^9(n)] < oo and ^ P [A e(n) ] = oo. 



By Borel-Cantelli's lemma, the events {X m ^ fl A e(n ) occur only for a 

finite number of times n, whereas the independent events Ag^ occur infinitely 
often. Thus for every word x in A lm , almost surely, 

{X m = x} = limsupv4 0(n ) n {X' mfi{n) = x}. 

n— >— oo 

This proves that {X m = x} belongs to J 7 ^', hence X m is a function of the 
innovations (V^J TO ^o- D 



3.4 Proof of some auxiliary facts 



Proof of proposition 3.1, Assume that (a n )n^o and exist such that 1 and 2 hold. 



Then, for every n ^ 0, r^( n ) ^ jV 2a "^( n ) hence log iV r^( ri ) ^ 2a n £^ n ). Therefore 

logjy r w > ^ logjy r 0(w ) > 

n n ^ K ' n 

and the last series diverges hence condition -iA holds. 

Conversely, assume that condition -iA holds. Let (5 n = \ log N (r n )/£ n . Then, 
r n = ATW« hence r n /N 2 ^ = N 2 ^ 1 ". 

Since ^^Ai diverges and \n\2 n converges, there exists an increasing applica- 
nt n^O 

tion <p such that the series ^_,^(n) diverges and such that ^ 2^ n ^\(j)(n) \ for 

n 

every n ^ 0. 

Replacing, if necessary, (j> by (p given by <p(n) = <j)(n — 1), one can ensure that 
the series P<j>(2n) diverges as well. 

n 

Since r n ^ 2 for every n, i^n) ^ 2~* (n ), hence ^ 2^w|0(n)| implies that 
^{n)^(n) ~> ^ M- Hence, 

r^/iV 2 ^")^") = jV 2 ^")^' 1 ) ^ iV 2 ' n ' ^ 1, 
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and this sequence converges to +00. Furthermore, defining a n = min(/3^( n ), 1), 
one sees that (a n ) n fulfills conditions 1-2-4-5 (condition 3 being a consequence of 
condition 4). 

We now show how to build from (a n ) n a sequence (a' n ) n such that condition 6 
holds as well. 

Recall that from the construction above, r n = jV 4/3n£ ™ and ^{ n )^<t>{n) ^ I0( n )l> 
hence r Hn) ^ N^)\ ^ 2~^ n \ Also a n ^ 2^ n )|0(n)|. 

These inequalities implies that the ratio g n = a n ^(n)/^0(n+i) is such that 

k=<f>(n)+l J 

^ 2^|0(n)| 2^ n+1 ^^ n ^ 1 2~ 4 ^ n+1 ^ 
= 2- 3 ^ n+1 )~ 1 |0HI- 

Since <fi(n + 1) ^ —1 and |</>(^)| ^ 2 for every n ^ —2, this shows that g n ^ 8 for 
every n ^ —2. 

Thus, Sg n /9 ^ [g n \ ^ g n and the sequence (a' n ) n defined by 

, _ L a "^(ri)/^(n+l)J 



n)/*<^(n+l) 



is such that ^a n ^ a' n ^ a n for every ri ^ — 2. Therefore it fulfills the condi- 
tions already satisfied by the sequence (a n ) n ^o an d the additional condition that 
Qn^(n)/^(«+i) i s an integer for every n ^ 0. □ 



Proof of lemma 3.3. Call B any finite upper bound of the sequence (6 n ) n£ N- Since 
On *C & n , for any positive integer there exists an integer iV^ such that for every 
n ^ iVfc, a n ^ b n 2~ k . We now define a sequence of disjoint intervals of integers 
Jk — {ik-, ■ ■ ■ ,jk}, as follows. Set j_i = —1 and let k ^ 0. 

Once jk-i is defined, let %k = max{j fc _! + 1, N k }. Since the series b n diverges 

n 

and ^ b n ^ B for every n e N, one can choose an integer is z fc such 

3k 

that i? ^ 6 n < 2.B and let Ik = {i^, . . . ,jk}- Note that b n ^ B and 
a n < 25/2 fe . 



Calling Q the set Ik, one gets 
fceN 

5Z a « = Yl Yl a " < 25^1/2 fc = AB, 



n£Q fceN ne Jfc AiGN 
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and _ 

neQ fceN n&Jk fceN 

which completes the proof of the lemma. □ 

4 Proof of non standardness under A 

The non standardness of the split-word process was established by Laurent in his 
thesis [8] and a similar result was obtained by Vershik in pU] in the context of 
decreasing sequences of measurable partitions. In this section we give a simplified 
presentation of Laurent's proof. 

The proof of this result involves a subtle notion on nitrations, which is standard- 
ness. The notion of standardness was first introduced by Vershik for decreasing 
measurable partitions. This notion has been adapted to continuous time nitra- 
tions by Tsirelson pQ, it has been formulated by Dubins, Feldman, Smorodinsky 
and Tsirelson [I] for continuous time nitrations and by Emery and Schacher- 
mayer [5] for discrete time nitrations. 

Many necessary and sufficient conditions for standardness have been established, 
for instance Vershik's self-joining criterion and various notions of cosiness. All 
these criterions are based on coupling methods. Checking them in specific cases 
is often a technical task. Yet, these criterions are the key tool to solve some 
difficult problems. For example, Tsirelson defines and uses a notion of cosiness 
to prove that the filtration of Walsh's Brownian motion is not Brownian since it 
is non standard. 

By definition, a filtration J 7 = (J r n ) n ^o indexed by n ^ is standard if, modulo an 
enlargement of the probability space, one can immerse J 7 in a filtration generated 
by an i.i.d. process. Recall that a filtration T = (-7 r n ) ns go is immersed in a filtration 
Q = (Gn)n^o if, for every n ^ 0, T n C Q n and T n and Q n -\ are independent 
conditionally on J-" n _i. Roughly speaking, this means that Q n -\ gives no further 
information on J-" n than Fn-i does. Equivalently, J 7 is immersed in Q if every 
.F-martingale is a ^-martingale. 

Laurent negates the so-called I-cosiness property to prove that under A, the 
filtration is non standard and therefore non of product type. 

We follow the method that Smorodinsky [6] used to prove the non-existence of 
a "generating parametrization" in the case where r n = 2 for every n ^ 0. This 
method still works in the general case and provides the non standard behaviour 
of the filtration. 

The purpose of this section is to show that if the sequence (r n ) n ^ is A, then the 
filtration is non standard. Note that A holds for every bounded sequence 
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4.1 Preliminary notions 

We first recall the notion of I-cosiness, due to Emery and Schachermayer [5] . 

Definition 6 (Immersion and co-immersion) Let J 7 = (J r n ) n <jo an d Q = 
(Qn)n^o denote two filtrations defined on the same probability space. 

• J 7 is immersed in Q if every martingale in J 7 is a martingale in Q . 

• J 7 and Q are co-immersed if J 7 and Q are both immersed in J 77 V Q . 

Definition 7 (I-cosiness) Let J 7 be a filtration on a probability space (Q,A, P). 
One says that J 7 satisfies the I-cosiness criterion if for every random variable Y 
measurable for To with values in a finite set and for every real 5 > 0, there exists 
a probability space (O, A, P) and two filtrations J 77 ' and J 77 " on (Q, A, P) ; such that 
the following properties hold. 

• The filtrations J 77 ' and J 77 " are both isomorphic to J 77 . 

• The filtrations J 77 ' and J 77 " are co-immersed. 

• There exists an integer n such that JF 1 and T'^ are independent. 

• The copies Y' and Y" ofY by the isomorphisms of the first condition, verify 
f[Y' ^ Y"\ <8. 

The proof of the non standardness of the filtration J 77 ^'^ uses the easy part of 
the equivalence between I-cosiness and standardness. 

Theorem 4 (Corollary 5 [5j) A filtration is standard if and only if it satisfies 
the I-cosiness criterion and is essentially separable. 

To prove the non standardness of J 7 ^'^, it is therefore sufficient to show that 
J^( X ' V ) does not satisfies the I-cosiness criterion. The tools of the proof are intro- 
duced just below. 

Let no be a negative integer which will be fixed later. 

Definition 8 (Definition of Aut n ) The intervals of integers {1, . . . ,£k}, 
{£k + !)•••) 2£/t}, . . . , {i n — + • • • > C} are called blocks of {1, ... , £ n } of length 
£k. Every permutation of the £ n /£ no blocks of length £ m which induces for every 
k G {n, . . . ,no} a bisection between the blocks of length 1^ is called an automor- 
phism of {1, . . . ,£ n } adapted to {r n+ i, r„ }. One denotes by Aut n the set of 
those permutations. 
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One can enumerate the automorphisms adapted to (r n ) nsS o, by induction, as fol- 
lows. By definition, an automorphism a in Aut n _i is built from a permutation o 
of {1, . . . ,r n } and from r n automorphisms {ak)isik^r n i n Aut„. One gets a from 
o and (afc)i^fcs;r n by setting, for every 1 ^ j ^ r n and every 1 ^ k < £ n , 

a((j - 1)4 + k) — (a(j) - 1)4 + 

Therefore #(Aut n _i) = #(6 r J(#(Aut„)) r " = r n !(#(Aut„)) r «. By induction 

n n 

#(Aut n ) = J] (r fc !) r "+— 1 = J] (r fc !)^-. 

fc=n+l fc=n+l 

Note that 

n ° lnr ! 

#(Aut n ) = exp(£ n S n ) where S n = — — . 

fc=n+l 

We now define a semi-metrics based on the Hamming distance. 

Definition 9 (Semi-metrics e n on A £n ) Recall that the Hamming distance on 
A ln is defined by 

fta/) = #{fc6{l,..,M : x(k)^x'(k)}. 

One defines an action of the group Aut n on A ln , seen as (A en o) en / en o ; by 

a ■ x = x o a -1 , 

and a semi-metrics e n on A ln , by 

e n (x, x') = — min{d^(a ■ x, x') ; a G Aut n }. 

The quantity e ra (a;,a; / ) is the smallest proportion of letters which are different 
between the words x' and x o a^ 1 as a goes through Aut n . Our next result is a 
recursion relation, useful to compute e n . 

Lemma 4.1 For every n ^ 0, x = (wi, . . . , w Tn ) and x' = (w[, . . . , w' r ) where 
Wi and w'j belong to A £n , 

e„_i(x, x') = inin ( — jf^ e n (w j , u£ (j) ) I . 

f7eto '-n-l yn j=1 J 
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Proof of lemma 4-1 



Using the decomposition of every automorphism a of Aut n _i into a permutation 
a of {1, ... , r n } and r n automorphisms a^, 1 ^ k ^ r n in Aut n , and the additivity 
of the restricted Hamming distance, one sees that £ n _ie n _i(x, x') is the minimum 
over these a and a& of the sums 



hence 



f n _ie n _i(x, x') = min [ i n ^ e„(wj, it;^) 



□ 



To prove that the filtration JF^ X ' V ^> is non standard under the assumption A, by 
denying the I-cosiness criterion, one considers X' and X" two copies of (X n ) n ^ Q 
such that: 

• The associated nitrations J 7 ' = T^ x ^ and J 7 " = T^ x '^ are co-immersed; 

• There exists an integer m < n such that (X' k )k^ m and (X%)^ m are inde- 
pendent. 

Our proof of the non standardness of J r ^ x ' v ' > includes three steps: 

. We prove the inequality F[X' m ^ X'> ] > E[e n {X' n , X%)]. 

• We bound below E[e n (X' n , X")} when X' n and X" are independent. 

• We negate the I-cosiness criterion. 

4.2 Proof of the inequality F[X' no ^ X'^] ^ E[e n (X' n , X'fi] for 
n ^ no 



For n ^ n one denotes by 

M n = F[X' no ± Xl\F' n V r n \ L n = e n {X' w Xl). 

By construction, (M n ) nsSno is a martingale. Loosely speaking, this martingale 
measures the influence of the past before time n on the word at time n Q . The 
key step is to prove that (M n ) nsSno is bounded below by (L n ) n ^ no . 
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Let us prove that (-^n)n^no is a sub-martingale in the filtration J- 1 V J~". Let 
n ^ n . Using the co-immersion of the nitrations J 7 ' and J 7 ", the conditional law 
of X' n given J-' n -\ V J-" n -\ is uniform on the r n sub-words of X'^-y of length £ n : 



£{K\F'n-i v JVi) = — E 



i), 

i=l 



where f n (x,v) denotes the v-th sub-word (of length £ n ) of x, for x G A^™- 1 and 
1 ^ v ^ r n (as in the proof of theorem |2|). Furthermore, 



1 Tn 

£(^nl^n-lVA-l) = — ^ <V„«-i>i)" 

Tn 3=1 

Therefore, the conditional law of (X' n ,X'^) given J-' n -\ V J-" n -\ can be written 
as follows 

where (C*j)i<ij<r„ is a bistochastic matrix measurable for J 7 ^ V ^-l- m P ar ~ 
ticular, 

S[L n |J' n _ 1 VJ" n _ 1 ] = - ]T C^e^UX'^.UX'^j)). 

This quantity is the image of the matrix {Cij)i^ij^ rn by a linear form. Since 
bistochastic matrices belong to the convex hull of the permutation matrices, one 
gets 



E\L n \T T n _ x \> - inf J2 e n(fn(X' n _ 1 , i ),f n (X^ 1 ,a( i ))), 

hence 

ElLnlJ 7 n -i V J 7 n _i] ^ e n -\(X / n _ 1 , X n _ x ) = L n _i, 

thanks to the recursion relation verified by the semi-metrics e ra . This shows that 
(L n ) n ^ no is a sub-martingale in the filtration J 7 ' V T" . 

Since M no = l{x^x^} > L no , one gets, for every n < n 

yielding the inequality P[X^ o ^ X^ o ] ^ E[e n (X' n ,X")} by taking the expectations. 

This inequality is going to be used to prove that the probability PpC Q ^ X"J can 
not be made as small as one wishes if the processes X' and X" are independent 
until a given time n. 
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4.3 Bounding below E[e n (X' n , X")] when X' n and X" are in- 
dependent 

The purpose of this subsection is to prove the next inequality: 

Lemma 4.2 Let (X£,)^ and (X^)^ be two copies of the split-word process that 
are independent until time n. Then, for every a > and n ^ n$, 



P [e n (X' n ,X^) ^ 1 - N- 1 - a] < exp (£ n (S n - 2a 2 )) , where S n = 



k=n+l 



'-k-1 



The proof is based on Hoeffding's large deviations inequality, see [9] for a proof 
of this inequality. 

Lemma 4.3 (Hoeffding) Let q in ]0, 1[ and {tk)i^k^n denote n independent 
Bernoulli random variables of parameter q, and Z n their average. Then, for 
every real a > 0, 

P [Z n ^ q - a] ^ exp(-2na 2 ). 



Proof of lemma ^2 One applies lemma 4.3 to the variables 



4XiW^(»)}' 



1 ^ i ^ 



Since X' n and X" are independent and uniform on A £n , the €i are independent 
and Bernoulli with parameter q^ = 1 — 1/N. Thus, denoting by the Hamming 
distance on A £n , 



P 



1 



d%(X' n , XZ) ^ q N - a 



sC exp(-2£ n a 2 ). 



For every a in Aut n , a ■ X" has the same law as X" and is independent of X '. 
Since, by definition e n (X ; n , X") = min{<i^(X^, a ■ X") ; a E Aut n }, one gets 



[e n (X' n ,X f :)^q N -a] 



3a e Aut n , -< (X;, a ■ X'£) ^ q N - a 



hence 



F[e n (X' n ,X';)^q N -a} ^ £ P 

aGAut n 

= «(Aut n )P 



d n {X'n, a ■ X n) ^QN -a 



d»(X' n ,XZ)^q N -a 



< exp(£ n S n ) exp(-2£ n a 2 ), 



since (j(Aut n ) = exp(£ n 5' r , 



□ 
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Choice of the integer no Under assumption A, by remark 1 after theorem [3j 
one can choose no ^ such that 

< 2(1 -1/A0 2 , where S.^ = ]T ln(a!) 



-fc-i 

k=— oo 



Once n is fixed, choose a real a such that \J S-^ j 2 < a < 1 — 1/N. 

Since S n — 2a 2 ^ SLoo — 2a 2 < and £ n ^ £ rao for every n ^ n , lemma 4.2 yields 

Pfe^^Xl-l/JV-a] < exp (£ n (S n - 2a 2 )) 

^ exp (^oOSLoc -2a 2 )) = /?, 

with p < 1. Hence, £[e n (X;, X^')] > (1 - £) (1 - 1/iV - a), which is positive. 



4.4 Negation of the I-cosiness criterion 

If (X', V) and (X", V") are two copies of the split-word processus whose nitra- 
tions are co- immersed and independent until time n ^ no, 

px; o + x;'j ^ E[e n {x' n ,x'i)\ - i/x - «), 

and this lower bound is positive, contradicting the I-cosiness criterion. Thus the 
filtration of the split-word process is non standard. □ 
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